Picture for Shuang Qiu

Shuang Qiu

University of Michigan, Ann Arbor

Where to Refine, When to Stop: Rethinking Redundancy via Latent Discrepancy for Efficient Visual Autoregressive Generation

Add code
May 29, 2026
Viaarxiv icon

Unifying Value Alignment and Assignment in Cross-Domain Offline Reinforcement Learning with Heterogeneous Datasets

Add code
May 24, 2026
Viaarxiv icon

Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent

Add code
May 04, 2026
Viaarxiv icon

AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

Add code
Apr 10, 2026
Viaarxiv icon

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

Add code
Mar 09, 2026
Viaarxiv icon

Deep Dense Exploration for LLM Reinforcement Learning via Pivot-Driven Resampling

Add code
Feb 15, 2026
Viaarxiv icon

SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration

Add code
Feb 04, 2026
Viaarxiv icon

USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots

Add code
Oct 09, 2025
Figure 1 for USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots
Figure 2 for USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots
Figure 3 for USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots
Figure 4 for USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots
Viaarxiv icon

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Add code
Sep 05, 2025
Figure 1 for LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Figure 2 for LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Figure 3 for LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Figure 4 for LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Viaarxiv icon

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models

Add code
May 29, 2025
Viaarxiv icon